Distributed Reinforcement Learning based onα-domination Strategy for Multi-criteria Decision Making and its Application to Distributed Database Systems
نویسندگان
چکیده
In the distributed systems in which information cannot be exchanged directly among agents, we deal with problems of deciding how each agent holds the shared resource. To achieve a lot of tasks greedily, agents tend to attempt to hold the resources for a long term. However the system performance decreases consequentially because it competes with the processing of other agents’ tasks. To acquire cooperative policies that avoid above competition, we formulate the resource sharing problems to multicriteria decision making problems with the priority level by using the domain knowledge into the reward. We propose distributed reinforcement learning that narrows the choice of action space by using the α-domination strategy based on value functions for the object and the cooperation. The proposed method is applied to the distributed database systems, and simulation results shows that our method acquires cooperative policies and improves the throughput performance of the system.
منابع مشابه
An Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources
This paper presents an online two-stage Q-learning based multi-agent (MA) controller for load frequency control (LFC) in an interconnected multi-area multi-source power system integrated with distributed energy resources (DERs). The proposed control strategy consists of two stages. The first stage is employed a PID controller which its parameters are designed using sine cosine optimization (SCO...
متن کاملOperation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm
: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...
متن کاملDynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)
In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...
متن کاملDistributed Reinforcement Learning in Multi-Agent Decision Systems
Decision problems can be usually solved using systems that implement diierent paradigms. These systems may be integrated into a single distributed system, with the expectation of obtaining a group performance more satisfactory than individual performances. Such a distributed system is what we call a Multi Agent Decision System (MADES), a special kind of Multi Agent System, that integrates sever...
متن کاملRanking Passive Seismic Control Systems by Their Effectiveness in Reducing Responses of High-Rise Buildings with Concrete Shear Walls Using Multiple-Criteria Decision Making
In recent decades, the dual systems of steel moment-resisting frames and RC shear walls have found extensive application as lateral load-resisting systems for high-rise structures in seismically active areas. This paper investigated the effectiveness of tuned mass damper (TMD), viscous damper, friction damper, and the lead-core rubber bearing in controlling the damage and seismic response of hi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004